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I.  INTRODUCTION 


Consider  a  random  sample  Xj,  Xj,  ...,  Xn  from  a  population  with  a  continuous 
distribution  function.  One  method  of  testing  the  hypothesis  that  the  n 
observations  come  from  a  population  with  a  specified  distribution  function  F(x)  is 
by  a  chi-square  test.  This  test  requires  a  subjective  partitioning  of  the  real  line  R 
and  a  comparision  of  the  empirical  histogram  with  the  hypothetical  histogram.  A 
more  objective  method,  is  to  compare  the  empirical  distribution  function  Fn(x) 
with  the  hypothetical  distribution  function  F(x).  The  empirical  distribution 

k 


function  based  on  n  observations  is  defined  as  F 
observations  are  less  than  or  equal  to  x  for  k  *-  0,  1,  ...,  n. 


i(x)  -  - 

n 


if  exactly  k 


To  compare  the  empirical  and  hypothetical  distribution  functions  a  measure 
of  their  difference  is  required.  Addressing  this,  Anderson  and  Darling  [1952] 
considered  the  following  metrics  in  function  space: 

wn2  -  n  7  |F„(x)  -  F(x)f  ^F(X)]  dF(x)  (1) 

—00 

and 


I<n  -  sup  Vn  lFn(x)  -  F(x)|V^[F(x)]  .  (2) 

-00  <  X  <  00 


Samples  producing  large  values  of  (or  K„)  lead  to  rejection  of  the  null 
hypothesis  that  the  population  distribution  function  is  F(x).  One  of  the 
contributions  of  Anderson  and  Darling  was  the  incorporation  of  a  non-negative 
weight  function  in  (1)  and  (2).  By  a  suitable  choice  for  specific  ranges  of 
values  of  the  random  variable  X,  corresponding  to  different  regions  of  the 
distribution  F(x),  may  be  emphasized.  For  F(x)]  s  1,  W|  becomes  the  Cram^r- 
von  Mises  statistic  [Cramer,  1928  and  von  Mises,  1931]  and  K,,  becomes  the 
Kolmogorov  statistic  [Kolmogorov,  1933]. 

The  tails  of  the  distribution  function  will  be  accentuated  in  the  investigation 
detailed  in  this  paper;  Anderson  and  Darling  suggest  using 

«X)]  “  F(x)[l  —  F(x)]  ' 

With  this  choice  for  the  weighting  function,  metric  (1)  becomes  the  basis  for  the 
Anderson-Darling  statistic. 
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In  Section  II,  the  Anderson-Darling  test  statistic  is  developed;  in  Section  III, 
the  most  accurate  tabulation  to  date  of  the  test  statistic  is  provided.  In  Section 
IV,  the  description  and  the  results  of  a  power  study  are  given  in  which  the 
Anderson-Darling,  the  Cram6r-von  Mises,  and  the  Kolmogorov  statistics  are 
compared. 


H.  THE  ANDERSON-DARLING  STATISTIC 

For  a  fixed  value  of  the  random  variable  X,  say  X  —  x,  the  empirical 
distribution  function  Fn(x)  is  a  statistic,  since  it  is  a  function  of  the  sample  values 
Xj,  x2,  xn.  The  distribution  of  this  statistic  is  established  as  a  lemma. 

Lemma  (1):  If  Fn(x)  is  the  empirical  distribution 
function  corresponding  to  a  random  sample  Xj,  X^,  ...,  X„ 
of  size  n  from  a  distribution  H(*),  then  for  a  fixed  x,  nFn(x) 
is  distributed  binomial  (H(x),n). 


Proof: 

P(nFn(x)  **  k)  —  P( exactly  k  values  xs  <  x),  for  k  ■*  0,  1,  ...,  n. 


Let  Zj 


!(-<»,  x)  (Xj),  where  the  indicator  function  I  is  defined  as 

[l,  if  — oo  <  X,  <  x 


Ij-OO.  >|  (X,) 


^  0,  otherwise  . 


Then  EZj  counts  the  number  of  sample  values  X;  <  x. 
Here  each  Zj  ^Bernoulli  (H(x)),  so  £Zj~binomial(H(x),n). 
Therefore, 

P(nF„(x)  —  k)  -*  P( exactly  k  values  x,  <  x) 
-P(EZj-k) 


v  t 


H(x)k  (l  -  H(x))"~k  . 
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From  Lemma  1, 

E[Fn(x)]-^E[nFn(x)]-H(x) 

and 

Var|F„(x)]  -  Var[nF„(x)]-  -i-  H(x)  [1  -H(x)]  .  (3) 

To  assist  in  the  determination  of  a  suitable  weighting  function  i^')>  that  is, 
a  function  that  will  weight  more  heavily  values  in  the  tails  of  the  distribution 
F(x)  at  the  expense  of  values  closer  to  the  median,  consider  the  expectation  of 
the  squared  discrepancy  E  Fn(x)  —  F(x)  .  It  is  important  to  keep  in  mind 

that  the  value  x  is  fixed,  so  F(x)  is  a  constant,  and  the  expectation  is  with 
respect  to  the  random  variable  Fn(x)  whose  distribution  was  established  in 
Lemma  1.  Then 

n  E  [f„(x)  -  F(x)]2  -  n  E  [f„(x)  -  H(x)  +  H(x)  -  F(x)]2 


n  E 


Fn(x)-H(x) 


-  F(x)-H(x) 


which,  after  algebraic  manipulation  (Appendix  A)  yields  the  variance  and  bias2 

>21 


n 


n 


H(x){l  -  H(x)} 


+ 


F(x)  -  H(x) 


(4) 


Under  the  null  hypothesis  H0:  H(x)  =  F(x)  Vx,  (4)  becomes 
n  E^Fn(x)  —  F(x)]2  —  F(x)[l  —  F(x)]  . 


(5) 


Anderson-Darling  chose  as  a  weighting  function,  $F(x)1  —  ■■  -  7 - —  —  . 

F(x)  [  1  —  F(x)j 
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Weighting  by  the  reciprocal  of  (5)  takes  into  consideration  the  variance  of  the 
statistic  Fn(x)  and  also  maintains  the  objective  of  accentuating  values  in  the 
tails  of  F(x). 


With  this  choice  of  weighting  function  and  without  loss  of  generality 
assuming  <  x2  <  ...  <  xn,  let  F(x)  —  u,  dF(x)  —  du,  and  F(xj)  —  Uj.  Then 
the  Anderson-Darling  test  statistic  (0)  can  be  rewritten  as  expression  (7)  by 
expansion  and  integration  (Appendix  B). 


7  FnM  -  F(x)]2 
"ioFM  (1  -F(x)] 


dF(x) , 


(6) 


\\  - - -  -  -f  S  f(2i— l)!n  u,  +  (2(n-i)+l)  ln(l-U|)|. 

"  1-1  L  J 


(7) 


III.  DISTRIBUTION  OF  THE  ANDERSON-DARLING  STATISTIC 

The  asymptotic  distribution  of  W“  was  derived  by  Anderson  and  Darling 
[1952].  Lewis  [1901]  undertook  the  tabulation  of  F(z;  n)  ■»  P(W„  <  z)  for 
n  =  1.  2,  ....  8  and  for  incremental  values  of  z  over  the  interval  [0.025,  8.000]. 
Lewis’  table  entries  were  computed  using  a  Monte  Carlo  procedure  to  generate 
an  empirical  approximation  Fm(z;n)  to  the  distribution  function  F(z;n)  based 
on  m  samples  of  size  n.  At  that  time,  computational  restrictions  essentially 
limited  the  accuracy  of  the  table  entries  to  within  0.00326  of  the  true  value. 

Following  an  analogous  procedure  based  on  expression  (7)  and  the 
observation  that  the  U,  are  distributed  U[0,1]  [Feller,  1966],  the  table  appearing 
in  Lewis’  paper  was  recalculated  using  a  Cray-2  supercomputer.  Table  1  lists 
the  reconstruction  of  Lewis’  table,  now  accurate  within  0.0005.  Again,  z  ranges 
from  0.025  to  8.000  and  for  n  »  1,  2,  ...,  10.  The  column  labeled  "oo"  contains 
the  asymptotic  values,  rounded  to  four  decimal  places. 

To  obtain  this  increased  accuracy,  a  Kolmogorov- type  bound  [Conover, 
1980]  was  used  to  construct  a  95%  confidence  band  for  the  distribution 
function  F(z;n).  In  general,  the  width  of  a  (1  —  a)100%  confidence  band  is 
equal  to  twice  the  value  of  the  (1  —  a)100%  quantile  of  the  Kolmogorov 

statistic  Km  «■  sup  Vm  lFm(x)  —  F(x)l  ,  where  m  is  the  number  of 

“OO  <  X  <  00 

Monte  Carlo  samples  of  size  n  used  in  the  construction  of  Fm(x).  With  n  fixed, 
the  95%  confidence  band  can  be  made  arbitrarily  small  by  a  suitable  choice  for 
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TABLE  1.  Quantiles  of  l'(z;n)  for  n  =  I  thru  lOnndoo. 


oc  tN  10 

i  d  d  icii-n^'CXCMirjr-oN^L'CN^r' 
ccccccccqccc^^r.HNNtMNrr^r,  r) 

dddddddddddddodddddddddd 


**5*^OC  ►-  r-i  rf  cccorNvor-or^u^ocQr^i^r- 
O  *  w  w  O  w  O  Q  C:  w  w  p  ^  *-  N  N  N  N  r,  re  r, 

ddddddddddddddddddddddod 


oo 

f-  (N  si  On 
C  r,  >7,  h 
c*  cQ  r^,  ro 


r^-  ro  »— ■  *-«  ^s 


IT)  ^  ^ 


UriC''«tQ'OCNr-*f^OO<N'OQ 

O'  d  iWiJOJC'-^sCXCM^cCC^T,  occr,  i?ix 
~  ii5555ccccccc^"'-'-M(NNN^^r, 


ss?, 


ooooooooooooo 


r-r^.  ^roC’—  r-r^.r^,  (vioc'w^^fC'^-r^'CCs 
ocr^C'  n  <**.  cc'occNar:  C'C^-.r^o 
p  —  rg-t  c  X  C  r,  IT,  oc  O  r,  IT)  oc  C  r^  ^  X 
CCCCCC-~*-^NMNN(?,  rr,  r, 


croc:c:c:orc:c:c:^c:c:ooaooooooooc:c 


p  —  X  «*“.  v*-,  O  —  vj  3*  x^i^^(N(N«C'T^'"CN 

d  C  w  rr  x^totx*- 

d  d  p  ^  c:  —  ci  rt-  ^  X  C  r,  ir,  x  C  r,  ir,  oc  C  r,  v,  x 

5*c55ccccc-'-^^nnn(N(7.  r^,  r.  r. 

d  d  d  d  d  d  d  d  d  d  d  d  d  o  d  d  d  d  d  d  d  d 


dd^Q^  —  C  t  c  t^.  ^  X  C  r.  O  x  ^  r,  IT,  x 

i*cccccccc--^^fNrJN(Nt^,r,  r,Pi 

dddddddddddddddddddddd 


<s^C'  —  'sC^r~' 

q  v~>  *—  V/S  ^  rs 

^  OC  -  r,  IT,  X 

N  R  r,  r,  c*-,  c*. 


C  C  ^  C  C  OC  M  r  O'  Tf  n  ^  X  X  tri  oc  t^,  N  ^ 

ddd^^~rl,T  C  X  r<-.  U-.  oc  O  r^.  c  oc  r,  O  X 
^55iorccc---^(SNNNr,  rt^r 


^  —  crcrcocrocrcooc^aooooooooooc: 

rjrr  —  —  ocxo^-NCO^tpr^^tocr^o 
*  d  d  d  d  P  —  v  t,  L-  r^.  fsj  r,  r^  r^,  J'  T,  m  x  t  v  ^  ^  f^,  h 

d  d  pppperC’—  oC^-r^.  p^^rrvCOC 

d  d  5  ^  J  J  J  5  c  o  c  c  c  -  -  rj  n  rs  tN  r,  r:  r,  r* 

d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d 


CCCCCC(NCC-^C-XO'f<lCC^^^'O^N 
JJir*55*fN,N^fr'NTtSHfs^f,iNO"nC'oa'N 

5555555cccccc--‘'-^MNNNr1r,  rr, 

d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d  d 


x  5c  x  5  5  5 

X  X  X  X  X  X 

d  d  d  d  d  5 


x^NCOvp^^-^'or^,^“p^so^-*(Npvc 
ccSxorrrsias^^-L-fNir^rNpor^ 
25  r^i  in  (X  —  i,'CaMvr^O'rNTpfs'C> 
oocc^-  —  '-^NNNNf^rinn 


dccddcdccdcpc»—  ’-,C'»fSc^C'4pr^r*">r"i 

ddddddddddddodddddddddddd 


x5SS2SgSg5SggggS§J2?58ffi!?ffiB 

iicifccccccccqC’-j’-'.iNNKi^r, 

ddddddddddddddddddddddddd 


§  £  £ 

po  co  co 


d»^.  d^.  S^S^.d^.  dS^d^S'RS^S'RS^S^? 

v  rj  i/,  r^dM*7rir'dr%,w-Jr^dNCor'd(N>7Sr^p<N»ot^*pr4 

C  C  C  -  -  ^  ^  f  N  (N  tN  O  O  O  O  ^  ^  t  't  IA  >A  “n  ^  o  O 

ddddddddddddddddddddddddd 


r4  <n  r-« 

•OIOIQ 
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0.6500  0.4814  0.4166  0.4143  0.40%  0.4069  0.4052  0.40.38  0.4035  0.4029  0.4024  0.3980 

0.6750  0.5007  0.4394  0.4357  0.4314  0.4286  0.4270  0.4256  0.4254  0.4248  0.4244  0.4199 

0.7000  0.5188  0.4618  0.4563  0.4524  0.4495  0.4481  0.4467  0.4464  0.4459  0.4456  0.4412 


TABLE  1.  continued 


8te  K 


v*jor^oc^tr-or*ir-,<tocr^f^oc^inQr^vcop^t^t(»t^^-rJooo 
—  ntn^Nx^NNOt^f^r'^wi^tri^i¥riiHo5iONocri5 
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2.1500  0.9102  0.9171  0.9195  0.9208  0.9214  0.9218  0.9221  0.9223  0.9226  0.9228  0.9239 

2.2000  0.9148  0.9218  0.9241  0.9254  0.9259  0,9264  0.9266  0.9268  0.9272  0.9273  0.9285 


TABLE  1.  continued 
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0.9797  0.9833  0.9843  0.9848  0.9851  0.9853  0.9854  0.9856  0.9857  0.9857  0.9863 

0.9807  0.9841  0.9851  0.9856  0.9859  0.9861  0.9862  0.9864  0.9865  0.9865  0.9870 

0.9816  0.9849  0.9859  0.9863  0.9867  0.9868  0,986.9  0.9871  0.9872  0.9872  0.9878 


TABLE  1.  continued 


8 


m.  The  commonly  tabled  [Miller.  1956]  asymptotic  approximation  for  the  95th 
quantile  is  1.358/Vm  .  However.  Harter  [1980]  suggests  using 

l 


1.358 


m  + 


-i  I 

2 


3.5 


where  r  —  (m  +  4) z 


(8) 


for  an  improved  approximation. 

Using  approximation  (8)  to  construct  a  95%  confidence  band  with  the  width 
not  exceeding  0.001.  the  value  for  m  must  be  at  least  7,375,881-  In  this 
simulation,  m  was  chosen  to  be  7.4  million1. 


IV.  POWER  STUDY 

The  power  of  the  Anderson-Darling  test  was  compared  with  two  other 
goodness-of-fit  procedures  based  on  the  empirical  distribution  function:  the 
Kolmogorov  and  the  Cramer-von  Mises  tests.  The  Kolmogorov  statistic 
introduced  in  Section  1  as  metric  (2)  with  weighting  function  1 1'  [F (x)]  =  1 
becomes 


Kn  =  sup  VnlF„(x) -F(x)l  .  (9) 

—  oc  <  x  %  oc 


For  an  ordered  sample  xt  <  x2  <  •  •  •  <  xn  and  F(x,)  =  u,.  1%  may  be  evaluated 
as  VnD  where  D  —  max)  D+.  D_  )  and 


D+  =  max 


D  =  max 
i 


I 


u, 


1  The  sample  values  were  obtained  via  a  linear  congruential  uniform  random  number  generator 
of  the  form  Xk+1  =  (aXk  +  c)mo(J  q  ,  where  a  =  273673163155g,  c  =  13g,  and  q  =  24  .  It  has 
the  properties  of  a  ’good’  random  number  generator  as  suggested  by  Rubinstein  [1981],  In 
addition,  several  subsets  of  the  random  numbers  generated  were  tested  for  autocorrelations 
with  lag  up  to  36  For  each  subset  and  each  lag.  the  autocorrelation  did  not  exceed  0.06  in 
absolute  value 
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The  Cramer-von  Mist  s  statistic,  defined  as 

W*  =  n  7  (F„(x)  -  F(x)j2  dF(x-) 

— oc 


(10) 

In  the  power  study,  two  cases  were  considered.  Case  1  corresponds  to  the 
situation  in  which  the  parameters  of  the  hypothesized  distribution  are 
completely  specified.  Case  2  corresponds  to  the  situation  in  which  the 
parameters  are  not  specified  and  must  be  estimated  from  the  sample  data. 

For  both  case  1  and  2.  the  null  hypothesis  is 

H0:  A  random  sample  X,.  Xo . Xn  comes  from  a  normal  population 

or 

H0:  F(x)  =  F0(x).  where  F0(x)  ~  N(/i.<r)  . 

As  alternative  hypotheses,  the  Cauchy,  double  exponential,  and  extreme  value 
distributions  were  chosen,  each  with  location  parameter  the  same  as  the  null 
hypothesis.  This  provided  a  heavy-tailed,  light-tailed,  and  skewed  distribution, 
respectively,  against  which  the  power  of  the  three  goodness-of-fit  tests  are 
compared. 

The  power  functions  do  not  exist  in  closed  form;  they  are  approximated 
empirically  via  a  Monte  Carlo  simulation.  To  determine  a  point  on  the  power 
curve,  a  large  number  of  samples  of  size  n  was  generated  from  a  specific 
distribution  serving  as  the  alternative  hypothesis.  The  number  of  times  that 
the  null  hypothesis  was  rejected  at  a  specific  level  of  significance  wras  recorded. 
The  ratio  of  the  number  of  rejections.  V,  to  the  total  number  of  samples 
generated.  N.  provides  an  estimate,  p  =  Y/N,  of  the  probability  of  rejecting 
the  null  hypothesis  when  it  should  be  rejected  (power).  The  value  p 
determines  a  point  on  the  power  curve  corresponding  to  a  specific  sample  size 
n,  a  specific  significance  level  a,  and  a  specific  alternative  hypothesis. 

To  determine  the  number  of  samples  of  size  n  required  for  a  sufficiently 
accurate  estimate  of  p,  a  nonparametric  technique  was  employed.  Since  the 
counter  Y  is  distributed  binomial (p.N)  where  the  parameter  p  is  the  true  but 
unknown  power,  and  since  an  approximate  confidence  interval  for  p  can  be 
constructed  [Conover.  1980]  using 


can  be  reduced  to  (10)  for  ease  of  computation  (Appendix  C); 

”  2i  —  1  2  ,  1 

VV  „  =  X  U.  —  -  +  -  . 

i-i  2n  12n 
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1  -Q  2s  P 


N 


Z(1  —  a/2) 


,  Y  Y 

[\u  -Y,/N 


o  _Y  [y  Y  H 

<P<  ..  +z(i-c/2)j  N(1  -  k)/N| 


N 


N 
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samples  of  size  n  continued  to  be  generated  from  the  alternative  distribution 
until  the  confidence  interval  for  p  given  in  (11)  was  sufficiently  small. 


The  confidence  interval  coefficient  1  —  a  was  chosen  to  be  0.975  and  the 
confidence  interval  width  not  to  exceed  0.025.  From  (11),  this  will  occur  when 


the  inequality  2  •  z0  gg7S 


<  0.025  is  satisfied.  For  anv  value  of 


X.  the  left-hand  side  of  the  inequality  is  maximum  when  Y/N  is  1/2. 
Substituting  Y/N  =  1/2  into  the  inequality  yields  N  <  8037  —  the  largest 
possible  number  of  sample1  required.  In  practice,  the  value  for  N  will  usually 
be  much  smaller  and  will  change  for  each  estimate  p.  This  dynamic  scheme 
was  chosen  rather  than  fixing  X  =  8037  for  the  entire  simulation.  A  minimum 
value  for  X  of  100  was  imposed  to  prevent  premature  termination  of  the 
procedure. 


1.  Case  1:  Distribution  Parameters  Specified. 

The  power  study  for  case  1  specified  the  parameters  of  the  hypothesized 
distribution  as  X(0.1).  The  results  of  the  study  are  summarized  in 
Figures  1  —  12.  For  each  of  the  three  distributions  serving  as  an  alternative 
hypothesis,  samples  of  size  n  =  5.  10.  15.  20  were  chosen  for  study  and.  as 
previously  mentioned,  the  location  parameters  of  both  the  null  and  alternative 
hypotheses  coincided.  The  scale  parameter  for  the  alternative  hypothesis 
ranged  from  0.025  to  3.000  in  increments  of  0.025. 

The  level  of  significance  for  the  study  wras  0.05.  The  critical  value  for  each 
test  was  determined  from  tables  in  Conover  [1980]  for  the  Kolmogorov  test. 
Stephens  and  Maag  [1968]  for  the  Cramer-von  Mises  test,  and  Table  1  in 
Section  III  of  this  paper  for  the  Anderson-Darling  test. 

The  Anderson-Darling  test  demonstrated  overall  superiority  for  the  sample 
sizes  and  hypotheses  chosen  for  this  study.  This  is  perhaps  to  be  anticipated  in 
view  of  the  emphasis  on  agreement  in  the  tails  by  the  Anderson-Darling 
procedure,  but  the  magnitude  of  difference  over  the  Kolmogorov  and  Cramer- 
von  Mises  tests  is  impressive. 

The  power  curves  corresponding  to  n  =  10.  15,  20  are  distinguished  by  their 
characteristic  of  decreasing  to  a  global  minimum  before  becoming 
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monotonicallv  increasing.  An  explanation  of  this  feature  is  suggested  by 
consideration  of  Figures  13  —  15  in  which  the  distribution  functions  of  the 
N(0,1)  and  Cauchy  (0,f)  are  compared.  There  it  is  seen  (Figure  14)  that 
corresponding  to  f  =  0.50  the  two  distribution  functions  are  similar;  an 
increase  (decrease)  in  the  scale  parameter  f  causes  the  tails  of  the  distributions 
to  become  more  distinct.  Values  in  a  neighborhood  of  f  cs  0.50  marked  the 
global  minimum  throughout  the  study. 


2.  Case  2:  Distribution  Parameters  Estimated. 

The  Anderson-Darling.  Kolmogorov,  and  Cramer-von  Mises  goodness-of-fit 
tests  w'ere  developed  for  use  in  case  1  where  distribution  parameters  are 
specified,  and  so  precludes  their  use  in  the  more  likely  situation  where 
parameters  must  be  estimated.  In  practice,  these  procedures  are  sometimes 
used  anyway  with  the  caveat  that  the  tests  are  likely  to  be  conservative. 
Stephens  [1974]  provides  adjustments  to  the  test  statistics  that  enables  the 
tests  to  be  used  to  test  the  assumption  H0:  F(x)  =  F0(x),  where  F0(x)  ~  N(//.cr) 
and  the  population  parameters  are  estimated  from  the  data. 

The  results  of  the  power  study  for  case  2,  are  summarized  in 
Figures  16  —  27.  As  in  case  1.  the  sample  sizes  are  n  =  5,  10,  15,  and  20,  and 
the  level  of  significance  is  0.05.  Both  location  and  scale  parameters  coincide: 
the  scale  parameter  are  values  from  0.025  to  3.000  in  increments  of  0.025. 

The  power  plots  are  horizontal,  demonstrating  that  power  does  not  change 
with  scale  parameter  and  provides  empirical  support  for  Stephens' 
transformations.  Power  increases  with  increasing  sample  size,  as  would  be 
expected.  When  n  =  5,  the  Anderson-Darling  test  was  the  least  powerful  of  all 
three  distributions  examined.  However,  none  of  the  distributions  had  power 
above  0.30  for  this  sample  size.  At  the  larger  sample  sizes,  the  Anderson- 
Darling  test  was  only  slightly  better  than  the  Kolmogorov  and  the  Cramer-von 
Mises  tests.  In  general,  when  both  location  and  scale  parameters  agree,  all 
three  tests  are  competitive  for  the  sample  sizes  and  alternative  distributions 
chosen  for  this  study. 
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Cauchy  vs  N(0,1) 


i9MOd 
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FIGURE  1.  Power  Plots  for  Cauchy(0,£)  vs  N(0, 1), 


Power  Plots  for  Double  Fxponential((),f)  vs  N(0, 1), 


Parameter 


Double  Exponential  vs  N(0,1) 


FIGURFI  7.  Power  Plots  for  Double  Hxponential(0,f)  vs  N(0, 1), 


Double  Exponential  vs  N(0,1) 


FIGURE  8.  Power  Plots  for  Double  Exponential(0, 


Extreme  Value  vs  N(0,1) 


FIGURE  9.  Power  Plots  for  Extreme  Value(O^)  vs  N(0, 1), 


Extreme  Value  vs  N(0,1 


FIGURF  10.  Power  Plots  for  Fxtremc  Vahic(0.^)  vs  N(( ),  1), 


Extreme  Value  vs  N(0,1) 


23 


Extreme  Value  vs  N(0,1) 


FIGURE  12.  Power  Plots  for  Extreme  Va 


FIGURE  13.  Cumulative  Distribution  Curves,  Normal(0,l)  &  Cauchy(0, 


Normal(0,l)  &  Cauchy(0,0.50) 


FIGURE  14.  Cumulative  Distribution  Curves,  Normal(0,1)  &  Cauchy(0, 


Normal(0,l)  &  Cauchy(0,l) 


FIGURE  15.  Cumulative  Distribution  Curves,  Normal(0,l)  &  Cauchy(0,l) 


FIGURE  17.  Power  Pic 


auchy 


FIGURE  18.  Power  Plots  for  Cauchy(0, 


Cauchy  vs  N(x, 


J3Mod 
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Double  Expo 


FIGURE  20.  Power  Plots  for  Double  Exponential^)  vs  N(x,s2), 


FIGURF,  21.  Power  Plots  for  Double  Fxponential(0,^)  vs  N(x,s2), 


Double  Exponential  vs  N(x, 


FIGURE  22.  Power  Plots  for  Double  Exponential(0,£)  vs  N(x,s  ), 


Double  Exponential  vs  N(x 


FIGURE  23.  Power  Plots  for  Double  Exponential^)  vs  N(x,s2), 


Extreme  Value  vs  N(x 


FIGURE  24.  Power  Plots  for  Extreme  Va 


Extreme  Value  vs  N(x 


FIGURE  26.  Power  Plots  for  Extreme  Value(0,f)  vs  N(x,s2), 


FIGURE  27.  Power  Plots  for  Extreme  Value(0,f)  vs  N(x,s  ) 
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APPENDIX  At  EXPECTATION  OF  SQUARED  DISCREPANCY 
BETWEEN  AN  EMPIRICAL  DISTRIBUTION  FUNCTION 
AND  A  SPECIFIED  DISTRIBUTION  FUNCTION 


E  [fb(x)  -  F(x)]2  -  E  [fb(x)  -  H(x)  +  H(x)  -  F(x)]J 


F„(x)-H(x) 


-  F(x)-H(x) 


Fn(x)-H(x) 


-2 


F„(x)  -  H(x) 


F(x)-H(x) 


+  F(x)-H(x) 


E  [f„(x)  -  H(x)]2  -  2  '  F(x)  -  H(x)  E  [f„(x)  -  H(x)] 


^  2 


+  F(x)-H(x) 
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For  an  ordered  sample  Xj  <  x2  <  •  •  •  <  xn  the  empirical  distribution 
function  is  defined  as 
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